Co - usage of documents in a large digital library ∗ submitted to Fourth Delos Workshop 2002 – 04 – 14
نویسندگان
چکیده
The RePEc Economics library offers the largest distributed source of freely downloadable scientific research reports in the world. WoPEc is a user services of that library. It operates on the Internet since 1993. It has a well-established user community, and a relatively narrow subject coverage. In this paper, we wish to find out which papers in the collection are similar through usage. The idea is that if different users request a couple of papers consistently together, then these papers are likely to correspond to the same information needs. They are similar in this sense. We present a theoretical discussion of these relationships and an empirical assessment. We introduce a measure of co-usage and estimate results for the WoPEc user service. This paper is available online at http://openlib.org/home/krichel/papers/kumegawa.html. However, that version does not contain mathematical expressions and is provided for evaluation purposes only. The full paper is available in PDF for A4 paper, and for letter size paper.) ∗The work discussed here has received financial support by the Joint Information Systems Committee of the UK Higher Education Funding Councils through its Electronic Library Programme.
منابع مشابه
Foundations of a Multidimensional Query Language for Digital Libraries
A query language for Digital Libraries is presented, which offers access to documents by structure and sophisticated usage of metadata. The language is based on a mathematical model of digital library documents, centered around a multilevel representation of documents as versions, views and manifestations. The core of the model is the notion of document view, which is recursive, and captures th...
متن کاملA System Architecture as a Support to a Flexible Annotation Service
Digital Library Management Systems are systems that are able to manage collections of digital documents that form Digital Libraries and Digital Archives, and they are currently in a state of evolution. Today, most of the times they are simply places where information resources can be stored and made available, whereas for tomorrow they are becoming an integrated part of the way the user works. ...
متن کاملCombining Multimedia Retrieval and Text Retrieval to Search Structured Documents in Digital Libraries
Digital Libraries usually contain a large collection of structured multimedia documents. At present text may be dominant in many applications, however the relevance of other media types such as image, audio and video increases steadily. An important functionality of a digital library in this respect is the retrieval of relevant multimedia documents or relevant parts of multimedia documents. To ...
متن کاملThe use of document structure analysis to retrieve information from documents in digital libraries
This paper describes an approach to retrieving information from document images stored in a digital library by means of knowledge-based layout analysis and logical structure derivation techniques. Queries on document image content are categorized in terms of the type of information that is desired (e.g., articles on a given topic), and are parsed to determine the type of document from which inf...
متن کاملREPORT OF THE DELOS-NSF WORKING GROUP ON DIGITAL IMAGERY FOR SIGNIFICANT CULTURAL AND HISTORICAL MATERIALS Co-Chairs
Recent revolutionary breakthroughs in computing and communications with the epoch-making arrival of the Internet have begun to demolish artificial disciplinary boundaries and to open vast new fields of interdisciplinary research. One major area was outlined in the recent report to the US President by the President’s Information Technology Advisory Committee (PITAC), entitled Digital Libraries: ...
متن کامل